Cross-industry Standard Process for Data Mining
Cross-industry standard process for data mining, known as CRISP-DM.
Six stages of CRISP-DM
- Business Understanding
- Data Understanding
- Data Preparation
- Modeling
- Evaluation
- Deployment
Top-down vs. Bottom-up
Data scientists may use either a “top-down” approach or a “bottom-up” approach to data science.
- “Top-down” approach:
- first defining a business problem then analyzing the data to find a solution
- “Bottom-up” approach:
- starting with the data, and then coming up with a business problem based on the data